Dataset statistics
| Number of variables | 12 |
|---|---|
| Number of observations | 10738 |
| Missing cells | 251 |
| Missing cells (%) | 0.2% |
| Duplicate rows | 0 |
| Duplicate rows (%) | 0.0% |
| Total size in memory | 1006.8 KiB |
| Average record size in memory | 96.0 B |
Variable types
| Categorical | 4 |
|---|---|
| Numeric | 8 |
customer_id has a high cardinality: 10738 distinct values | High cardinality |
customer_product_search_score is highly correlated with customer_stay_score | High correlation |
customer_ctr_score is highly correlated with customer_stay_score and 1 other fields | High correlation |
customer_stay_score is highly correlated with customer_product_search_score and 1 other fields | High correlation |
customer_frequency_score is highly correlated with customer_product_variation_score and 2 other fields | High correlation |
customer_product_variation_score is highly correlated with customer_frequency_score and 2 other fields | High correlation |
customer_order_score is highly correlated with customer_frequency_score and 2 other fields | High correlation |
customer_affinity_score is highly correlated with customer_frequency_score and 2 other fields | High correlation |
customer_category is highly correlated with customer_ctr_score | High correlation |
customer_visit_score is highly correlated with customer_ctr_score | High correlation |
customer_ctr_score is highly correlated with customer_visit_score and 2 other fields | High correlation |
customer_stay_score is highly correlated with customer_ctr_score and 1 other fields | High correlation |
customer_frequency_score is highly correlated with customer_product_variation_score and 1 other fields | High correlation |
customer_product_variation_score is highly correlated with customer_frequency_score and 2 other fields | High correlation |
customer_order_score is highly correlated with customer_frequency_score and 2 other fields | High correlation |
customer_affinity_score is highly correlated with customer_product_variation_score and 1 other fields | High correlation |
customer_category is highly correlated with customer_ctr_score and 1 other fields | High correlation |
customer_ctr_score is highly correlated with customer_stay_score | High correlation |
customer_stay_score is highly correlated with customer_ctr_score | High correlation |
customer_frequency_score is highly correlated with customer_product_variation_score and 2 other fields | High correlation |
customer_product_variation_score is highly correlated with customer_frequency_score and 2 other fields | High correlation |
customer_order_score is highly correlated with customer_frequency_score and 2 other fields | High correlation |
customer_affinity_score is highly correlated with customer_frequency_score and 2 other fields | High correlation |
customer_visit_score is highly correlated with customer_ctr_score and 3 other fields | High correlation |
customer_product_search_score is highly correlated with customer_ctr_score and 1 other fields | High correlation |
customer_ctr_score is highly correlated with customer_visit_score and 7 other fields | High correlation |
customer_stay_score is highly correlated with customer_visit_score and 6 other fields | High correlation |
customer_frequency_score is highly correlated with customer_ctr_score and 4 other fields | High correlation |
customer_product_variation_score is highly correlated with customer_ctr_score and 5 other fields | High correlation |
customer_order_score is highly correlated with customer_ctr_score and 5 other fields | High correlation |
customer_affinity_score is highly correlated with customer_order_score and 1 other fields | High correlation |
customer_active_segment is highly correlated with customer_visit_score and 2 other fields | High correlation |
X1 is highly correlated with customer_product_variation_score and 2 other fields | High correlation |
customer_category is highly correlated with customer_visit_score and 5 other fields | High correlation |
customer_id is uniformly distributed | Uniform |
customer_id has unique values | Unique |
customer_visit_score has unique values | Unique |
customer_ctr_score has unique values | Unique |
customer_frequency_score has unique values | Unique |
customer_affinity_score has unique values | Unique |
Reproduction
| Analysis started | 2022-01-25 02:46:38.886903 |
|---|---|
| Analysis finished | 2022-01-25 02:47:06.075853 |
| Duration | 27.19 seconds |
| Software version | pandas-profiling v3.1.0 |
| Download configuration | config.json |
| Distinct | 10738 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 84.0 KiB |
| csid_1 | 1 |
|---|---|
| csid_7153 | 1 |
| csid_7155 | 1 |
| csid_7156 | 1 |
| csid_7157 | 1 |
| Other values (10733) |
Length
| Max length | 10 |
|---|---|
| Median length | 9 |
| Mean length | 8.965729186 |
| Min length | 6 |
Characters and Unicode
| Total characters | 0 |
|---|---|
| Distinct characters | 0 |
| Distinct categories | 0 ? |
| Distinct scripts | 0 ? |
| Distinct blocks | 0 ? |
Unique
| Unique | 10738 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | csid_1 |
|---|---|
| 2nd row | csid_2 |
| 3rd row | csid_3 |
| 4th row | csid_4 |
| 5th row | csid_5 |
Common Values
| Value | Count | Frequency (%) |
| csid_1 | 1 | < 0.1% |
| csid_7153 | 1 | < 0.1% |
| csid_7155 | 1 | < 0.1% |
| csid_7156 | 1 | < 0.1% |
| csid_7157 | 1 | < 0.1% |
| csid_7158 | 1 | < 0.1% |
| csid_7159 | 1 | < 0.1% |
| csid_7160 | 1 | < 0.1% |
| csid_7161 | 1 | < 0.1% |
| csid_7162 | 1 | < 0.1% |
| Other values (10728) | 10728 |
Length
| Value | Count | Frequency (%) |
| csid_1 | 1 | < 0.1% |
| csid_23 | 1 | < 0.1% |
| csid_42 | 1 | < 0.1% |
| csid_20 | 1 | < 0.1% |
| csid_3 | 1 | < 0.1% |
| csid_4 | 1 | < 0.1% |
| csid_5 | 1 | < 0.1% |
| csid_6 | 1 | < 0.1% |
| csid_7 | 1 | < 0.1% |
| csid_8 | 1 | < 0.1% |
| Other values (10728) | 10728 |
Most occurring characters
| Value | Count | Frequency (%) |
| No values found. | ||
Most occurring categories
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per category
Most occurring scripts
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per script
Most occurring blocks
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per block
| Distinct | 10738 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 19.06094129 |
| Minimum | 0.5689647667 |
|---|---|
| Maximum | 47.30669098 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 84.0 KiB |
Quantile statistics
| Minimum | 0.5689647667 |
|---|---|
| 5-th percentile | 7.442481958 |
| Q1 | 13.51802134 |
| median | 18.77410921 |
| Q3 | 24.50171939 |
| 95-th percentile | 31.42445376 |
| Maximum | 47.30669098 |
| Range | 46.73772622 |
| Interquartile range (IQR) | 10.98369805 |
Descriptive statistics
| Standard deviation | 7.419609076 |
|---|---|
| Coefficient of variation (CV) | 0.389257223 |
| Kurtosis | -0.4065214262 |
| Mean | 19.06094129 |
| Median Absolute Deviation (MAD) | 5.439947497 |
| Skewness | 0.1014477924 |
| Sum | 204676.3876 |
| Variance | 55.05059884 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 13.16842493 | 1 | < 0.1% |
| 31.58945448 | 1 | < 0.1% |
| 14.7593968 | 1 | < 0.1% |
| 9.136171335 | 1 | < 0.1% |
| 3.790594219 | 1 | < 0.1% |
| 20.37354099 | 1 | < 0.1% |
| 24.3906417 | 1 | < 0.1% |
| 13.51919538 | 1 | < 0.1% |
| 29.63397706 | 1 | < 0.1% |
| 18.9473251 | 1 | < 0.1% |
| Other values (10728) | 10728 |
| Value | Count | Frequency (%) |
| 0.5689647667 | 1 | |
| 0.6441806855 | 1 | |
| 0.6650534717 | 1 | |
| 0.715215517 | 1 | |
| 0.9186268439 | 1 | |
| 0.9418460834 | 1 | |
| 0.9797893876 | 1 | |
| 0.9957208668 | 1 | |
| 1.045409591 | 1 | |
| 1.062848533 | 1 |
| Value | Count | Frequency (%) |
| 47.30669098 | 1 | |
| 43.92674833 | 1 | |
| 43.75726982 | 1 | |
| 42.34256741 | 1 | |
| 42.19495825 | 1 | |
| 41.97600383 | 1 | |
| 41.00608616 | 1 | |
| 40.67115079 | 1 | |
| 40.57284952 | 1 | |
| 40.3873391 | 1 |
| Distinct | 10696 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 42 |
| Missing (%) | 0.4% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 5.274847153 |
| Minimum | -0.1619399818 |
|---|---|
| Maximum | 16.6382433 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 2 |
| Negative (%) | < 0.1% |
| Memory size | 84.0 KiB |
Quantile statistics
| Minimum | -0.1619399818 |
|---|---|
| 5-th percentile | 2.262566301 |
| Q1 | 3.971586843 |
| median | 5.218479286 |
| Q3 | 6.520363539 |
| 95-th percentile | 8.386104872 |
| Maximum | 16.6382433 |
| Range | 16.80018328 |
| Interquartile range (IQR) | 2.548776696 |
Descriptive statistics
| Standard deviation | 1.882558586 |
|---|---|
| Coefficient of variation (CV) | 0.3568934855 |
| Kurtosis | 0.545163275 |
| Mean | 5.274847153 |
| Median Absolute Deviation (MAD) | 1.276309404 |
| Skewness | 0.2892716474 |
| Sum | 56419.76515 |
| Variance | 3.54402683 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 9.447661691 | 1 | < 0.1% |
| 4.738416608 | 1 | < 0.1% |
| 7.149131419 | 1 | < 0.1% |
| 6.467830973 | 1 | < 0.1% |
| 4.427239253 | 1 | < 0.1% |
| 6.141753694 | 1 | < 0.1% |
| 5.719343629 | 1 | < 0.1% |
| 2.341663508 | 1 | < 0.1% |
| 3.354625648 | 1 | < 0.1% |
| 5.232748727 | 1 | < 0.1% |
| Other values (10686) | 10686 | |
| (Missing) | 42 | 0.4% |
| Value | Count | Frequency (%) |
| -0.1619399818 | 1 | |
| -0.04875713064 | 1 | |
| 0.0644344974 | 1 | |
| 0.08783044642 | 1 | |
| 0.1758818048 | 1 | |
| 0.2199372422 | 1 | |
| 0.2736137469 | 1 | |
| 0.2755627436 | 1 | |
| 0.3516462314 | 1 | |
| 0.3961314961 | 1 |
| Value | Count | Frequency (%) |
| 16.6382433 | 1 | |
| 16.63088664 | 1 | |
| 15.51932408 | 1 | |
| 14.65319495 | 1 | |
| 14.64986837 | 1 | |
| 14.60104289 | 1 | |
| 14.48426809 | 1 | |
| 14.34562558 | 1 | |
| 13.60208564 | 1 | |
| 13.4746232 | 1 |
customer_ctr_score
Real number (ℝ)
HIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONUNIQUE| Distinct | 10738 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.1759119806 |
| Minimum | -0.5479890838 |
|---|---|
| Maximum | 2.679474242 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 2258 |
| Negative (%) | 21.0% |
| Memory size | 84.0 KiB |
Quantile statistics
| Minimum | -0.5479890838 |
|---|---|
| 5-th percentile | -0.07250584954 |
| Q1 | 0.01084001214 |
| median | 0.07407813627 |
| Q3 | 0.1596064355 |
| 95-th percentile | 1.072821684 |
| Maximum | 2.679474242 |
| Range | 3.227463326 |
| Interquartile range (IQR) | 0.1487664233 |
Descriptive statistics
| Standard deviation | 0.3728289383 |
|---|---|
| Coefficient of variation (CV) | 2.119406177 |
| Kurtosis | 10.96033251 |
| Mean | 0.1759119806 |
| Median Absolute Deviation (MAD) | 0.07104629897 |
| Skewness | 3.216021049 |
| Sum | 1888.942848 |
| Variance | 0.1390014172 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| -0.07020264289 | 1 | < 0.1% |
| 0.0005898212806 | 1 | < 0.1% |
| 0.04341743782 | 1 | < 0.1% |
| 0.08046201535 | 1 | < 0.1% |
| 1.177369407 | 1 | < 0.1% |
| 0.06061555739 | 1 | < 0.1% |
| 0.07101235966 | 1 | < 0.1% |
| -0.08614276517 | 1 | < 0.1% |
| 0.01564443462 | 1 | < 0.1% |
| 0.04396936788 | 1 | < 0.1% |
| Other values (10728) | 10728 |
| Value | Count | Frequency (%) |
| -0.5479890838 | 1 | |
| -0.5462274631 | 1 | |
| -0.5384683288 | 1 | |
| -0.5339414237 | 1 | |
| -0.5324857093 | 1 | |
| -0.4901018834 | 1 | |
| -0.4865959046 | 1 | |
| -0.4811338877 | 1 | |
| -0.4802283604 | 1 | |
| -0.4797968087 | 1 |
| Value | Count | Frequency (%) |
| 2.679474242 | 1 | |
| 2.571238413 | 1 | |
| 2.57043864 | 1 | |
| 2.510406547 | 1 | |
| 2.390943097 | 1 | |
| 2.38959897 | 1 | |
| 2.384603231 | 1 | |
| 2.383494842 | 1 | |
| 2.375461326 | 1 | |
| 2.360680643 | 1 |
customer_stay_score
Real number (ℝ)
HIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATION| Distinct | 10701 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 37 |
| Missing (%) | 0.3% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.3742300618 |
| Minimum | -0.4624940639 |
|---|---|
| Maximum | 14.70191417 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 3962 |
| Negative (%) | 36.9% |
| Memory size | 84.0 KiB |
Quantile statistics
| Minimum | -0.4624940639 |
|---|---|
| 5-th percentile | -0.1002052956 |
| Q1 | -0.02766573337 |
| median | 0.03720079496 |
| Q3 | 0.1790287653 |
| 95-th percentile | 2.441966972 |
| Maximum | 14.70191417 |
| Range | 15.16440824 |
| Interquartile range (IQR) | 0.2066944986 |
Descriptive statistics
| Standard deviation | 1.222030798 |
|---|---|
| Coefficient of variation (CV) | 3.265453321 |
| Kurtosis | 29.79532391 |
| Mean | 0.3742300618 |
| Median Absolute Deviation (MAD) | 0.08274564002 |
| Skewness | 5.008726307 |
| Sum | 4004.635892 |
| Variance | 1.493359272 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| -0.1395408793 | 1 | < 0.1% |
| 1.407613358 | 1 | < 0.1% |
| -0.003648490558 | 1 | < 0.1% |
| -5.021905798 × 10-5 | 1 | < 0.1% |
| -0.05240824015 | 1 | < 0.1% |
| 0.04723501864 | 1 | < 0.1% |
| 0.1312583748 | 1 | < 0.1% |
| -0.004738666148 | 1 | < 0.1% |
| -0.01384644227 | 1 | < 0.1% |
| 0.001954872595 | 1 | < 0.1% |
| Other values (10691) | 10691 | |
| (Missing) | 37 | 0.3% |
| Value | Count | Frequency (%) |
| -0.4624940639 | 1 | |
| -0.3895174278 | 1 | |
| -0.3724808355 | 1 | |
| -0.3532560254 | 1 | |
| -0.3492770074 | 1 | |
| -0.3458772237 | 1 | |
| -0.3329127087 | 1 | |
| -0.3303314744 | 1 | |
| -0.3171502534 | 1 | |
| -0.3157417019 | 1 |
| Value | Count | Frequency (%) |
| 14.70191417 | 1 | |
| 14.28113287 | 1 | |
| 13.53972037 | 1 | |
| 12.40849733 | 1 | |
| 11.99354174 | 1 | |
| 11.93459646 | 1 | |
| 11.65400787 | 1 | |
| 11.59071055 | 1 | |
| 11.55208996 | 1 | |
| 11.414581 | 1 |
customer_frequency_score
Real number (ℝ≥0)
HIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONUNIQUE| Distinct | 10738 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 2.376894688 |
| Minimum | 0.02857521051 |
|---|---|
| Maximum | 52.39501392 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 84.0 KiB |
Quantile statistics
| Minimum | 0.02857521051 |
|---|---|
| 5-th percentile | 0.1542461741 |
| Q1 | 0.3136096545 |
| median | 0.5168299359 |
| Q3 | 1.125379515 |
| 95-th percentile | 14.11275303 |
| Maximum | 52.39501392 |
| Range | 52.36643871 |
| Interquartile range (IQR) | 0.8117698602 |
Descriptive statistics
| Standard deviation | 5.601910934 |
|---|---|
| Coefficient of variation (CV) | 2.356819157 |
| Kurtosis | 19.19443114 |
| Mean | 2.376894688 |
| Median Absolute Deviation (MAD) | 0.2615149293 |
| Skewness | 4.083012882 |
| Sum | 25523.09516 |
| Variance | 31.38140612 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0.4369560308 | 1 | < 0.1% |
| 0.3531479719 | 1 | < 0.1% |
| 20.2667777 | 1 | < 0.1% |
| 1.933699216 | 1 | < 0.1% |
| 34.82408972 | 1 | < 0.1% |
| 0.503856912 | 1 | < 0.1% |
| 13.6320122 | 1 | < 0.1% |
| 12.17778256 | 1 | < 0.1% |
| 5.618736672 | 1 | < 0.1% |
| 0.5456667329 | 1 | < 0.1% |
| Other values (10728) | 10728 |
| Value | Count | Frequency (%) |
| 0.02857521051 | 1 | |
| 0.03332008134 | 1 | |
| 0.0355902314 | 1 | |
| 0.03591151507 | 1 | |
| 0.03660536941 | 1 | |
| 0.04070790865 | 1 | |
| 0.04130951588 | 1 | |
| 0.04324217365 | 1 | |
| 0.0461660583 | 1 | |
| 0.04668537241 | 1 |
| Value | Count | Frequency (%) |
| 52.39501392 | 1 | |
| 49.67938001 | 1 | |
| 49.03419464 | 1 | |
| 47.81685008 | 1 | |
| 46.92130903 | 1 | |
| 46.75971585 | 1 | |
| 46.70299502 | 1 | |
| 46.44652515 | 1 | |
| 46.12204014 | 1 | |
| 45.62266754 | 1 |
customer_product_variation_score
Real number (ℝ≥0)
HIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATION| Distinct | 10692 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 46 |
| Missing (%) | 0.4% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 5.788179529 |
| Minimum | 2.752836148 |
|---|---|
| Maximum | 18.74383572 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 84.0 KiB |
Quantile statistics
| Minimum | 2.752836148 |
|---|---|
| 5-th percentile | 3.541244833 |
| Q1 | 4.193234472 |
| median | 4.842574595 |
| Q3 | 6.286400327 |
| 95-th percentile | 11.66568404 |
| Maximum | 18.74383572 |
| Range | 15.99099957 |
| Interquartile range (IQR) | 2.093165855 |
Descriptive statistics
| Standard deviation | 2.531309458 |
|---|---|
| Coefficient of variation (CV) | 0.4373239367 |
| Kurtosis | 3.191873393 |
| Mean | 5.788179529 |
| Median Absolute Deviation (MAD) | 0.8261130491 |
| Skewness | 1.851646948 |
| Sum | 61887.21552 |
| Variance | 6.40752757 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 4.705760939 | 1 | < 0.1% |
| 5.306686105 | 1 | < 0.1% |
| 10.6925668 | 1 | < 0.1% |
| 9.499833305 | 1 | < 0.1% |
| 10.2482978 | 1 | < 0.1% |
| 5.344186221 | 1 | < 0.1% |
| 10.21667422 | 1 | < 0.1% |
| 13.01691952 | 1 | < 0.1% |
| 12.69785088 | 1 | < 0.1% |
| 10.52207597 | 1 | < 0.1% |
| Other values (10682) | 10682 | |
| (Missing) | 46 | 0.4% |
| Value | Count | Frequency (%) |
| 2.752836148 | 1 | |
| 2.787879879 | 1 | |
| 2.812295598 | 1 | |
| 2.821770078 | 1 | |
| 2.822925888 | 1 | |
| 2.853035872 | 1 | |
| 2.860919714 | 1 | |
| 2.888028761 | 1 | |
| 2.890614443 | 1 | |
| 2.905874869 | 1 |
| Value | Count | Frequency (%) |
| 18.74383572 | 1 | |
| 18.48790753 | 1 | |
| 18.42971169 | 1 | |
| 18.3418688 | 1 | |
| 18.26678114 | 1 | |
| 18.04871108 | 1 | |
| 17.67544028 | 1 | |
| 17.64926437 | 1 | |
| 17.33221461 | 1 | |
| 17.32895438 | 1 |
customer_order_score
Real number (ℝ≥0)
HIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATION| Distinct | 10672 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 66 |
| Missing (%) | 0.6% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 3.150070539 |
| Minimum | 0.3633379501 |
|---|---|
| Maximum | 9.090205509 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 84.0 KiB |
Quantile statistics
| Minimum | 0.3633379501 |
|---|---|
| 5-th percentile | 1.532564209 |
| Q1 | 2.454017385 |
| median | 3.118394172 |
| Q3 | 3.756566397 |
| 95-th percentile | 4.892306349 |
| Maximum | 9.090205509 |
| Range | 8.726867559 |
| Interquartile range (IQR) | 1.302549012 |
Descriptive statistics
| Standard deviation | 1.03541551 |
|---|---|
| Coefficient of variation (CV) | 0.3286959759 |
| Kurtosis | 1.210741347 |
| Mean | 3.150070539 |
| Median Absolute Deviation (MAD) | 0.6528132847 |
| Skewness | 0.5768648974 |
| Sum | 33617.55279 |
| Variance | 1.072085278 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 2.537985052 | 1 | < 0.1% |
| 3.122050243 | 1 | < 0.1% |
| 1.543632469 | 1 | < 0.1% |
| 1.851068545 | 1 | < 0.1% |
| 1.390036732 | 1 | < 0.1% |
| 3.301753618 | 1 | < 0.1% |
| 2.047554951 | 1 | < 0.1% |
| 2.231378199 | 1 | < 0.1% |
| 2.868131962 | 1 | < 0.1% |
| 1.923436413 | 1 | < 0.1% |
| Other values (10662) | 10662 | |
| (Missing) | 66 | 0.6% |
| Value | Count | Frequency (%) |
| 0.3633379501 | 1 | |
| 0.5371367527 | 1 | |
| 0.5610723909 | 1 | |
| 0.5692797177 | 1 | |
| 0.5997546164 | 1 | |
| 0.6094512156 | 1 | |
| 0.6511893556 | 1 | |
| 0.713192621 | 1 | |
| 0.7212757881 | 1 | |
| 0.7238911627 | 1 |
| Value | Count | Frequency (%) |
| 9.090205509 | 1 | |
| 8.951938748 | 1 | |
| 8.937619861 | 1 | |
| 8.357390523 | 1 | |
| 8.226249469 | 1 | |
| 8.170225052 | 1 | |
| 8.101784031 | 1 | |
| 8.061836027 | 1 | |
| 8.046257167 | 1 | |
| 8.010350996 | 1 |
customer_affinity_score
Real number (ℝ)
HIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONUNIQUE| Distinct | 10738 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 17.06183579 |
| Minimum | -0.4868340563 |
|---|---|
| Maximum | 248.5527547 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 718 |
| Negative (%) | 6.7% |
| Memory size | 84.0 KiB |
Quantile statistics
| Minimum | -0.4868340563 |
|---|---|
| 5-th percentile | -0.08343869615 |
| Q1 | 4.530085389 |
| median | 12.65335707 |
| Q3 | 23.11457668 |
| 95-th percentile | 50.46467763 |
| Maximum | 248.5527547 |
| Range | 249.0395888 |
| Interquartile range (IQR) | 18.58449129 |
Descriptive statistics
| Standard deviation | 18.76269336 |
|---|---|
| Coefficient of variation (CV) | 1.099687841 |
| Kurtosis | 16.85754965 |
| Mean | 17.06183579 |
| Median Absolute Deviation (MAD) | 8.987608416 |
| Skewness | 2.993483837 |
| Sum | 183209.9927 |
| Variance | 352.0386622 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 7.959503221 | 1 | < 0.1% |
| 5.174743812 | 1 | < 0.1% |
| -0.4033399061 | 1 | < 0.1% |
| 1.32251184 | 1 | < 0.1% |
| -0.09010904113 | 1 | < 0.1% |
| 12.870484 | 1 | < 0.1% |
| -0.1571265563 | 1 | < 0.1% |
| -0.2660333716 | 1 | < 0.1% |
| -0.2323887605 | 1 | < 0.1% |
| 7.618839839 | 1 | < 0.1% |
| Other values (10728) | 10728 |
| Value | Count | Frequency (%) |
| -0.4868340563 | 1 | |
| -0.482894096 | 1 | |
| -0.473328527 | 1 | |
| -0.4556253366 | 1 | |
| -0.4542975823 | 1 | |
| -0.4379394688 | 1 | |
| -0.4305667991 | 1 | |
| -0.4291398077 | 1 | |
| -0.4199759236 | 1 | |
| -0.4186963995 | 1 |
| Value | Count | Frequency (%) |
| 248.5527547 | 1 | |
| 246.9369655 | 1 | |
| 218.4587702 | 1 | |
| 206.6697283 | 1 | |
| 198.923264 | 1 | |
| 197.3232906 | 1 | |
| 182.2244816 | 1 | |
| 173.2061211 | 1 | |
| 167.6246271 | 1 | |
| 165.0966305 | 1 |
| Distinct | 5 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 23 |
| Missing (%) | 0.2% |
| Memory size | 84.0 KiB |
| C | |
|---|---|
| B | |
| D | |
| AA | 418 |
| A | 412 |
Length
| Max length | 2 |
|---|---|
| Median length | 1 |
| Mean length | 1.039010733 |
| Min length | 1 |
Characters and Unicode
| Total characters | 0 |
|---|---|
| Distinct characters | 0 |
| Distinct categories | 0 ? |
| Distinct scripts | 0 ? |
| Distinct blocks | 0 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | C |
|---|---|
| 2nd row | C |
| 3rd row | C |
| 4th row | AA |
| 5th row | C |
Common Values
| Value | Count | Frequency (%) |
| C | 4919 | |
| B | 4430 | |
| D | 536 | 5.0% |
| AA | 418 | 3.9% |
| A | 412 | 3.8% |
| (Missing) | 23 | 0.2% |
Length
Pie chart
| Value | Count | Frequency (%) |
| c | 4919 | |
| b | 4430 | |
| d | 536 | 5.0% |
| aa | 418 | 3.9% |
| a | 412 | 3.8% |
Most occurring characters
| Value | Count | Frequency (%) |
| No values found. | ||
Most occurring categories
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per category
Most occurring scripts
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per script
Most occurring blocks
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per block
| Distinct | 5 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 37 |
| Missing (%) | 0.3% |
| Memory size | 84.0 KiB |
| BA | |
|---|---|
| A | |
| F | |
| AA | |
| E | 76 |
Length
| Max length | 2 |
|---|---|
| Median length | 2 |
| Mean length | 1.572096066 |
| Min length | 1 |
Characters and Unicode
| Total characters | 0 |
|---|---|
| Distinct characters | 0 |
| Distinct categories | 0 ? |
| Distinct scripts | 0 ? |
| Distinct blocks | 0 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | F |
|---|---|
| 2nd row | A |
| 3rd row | BA |
| 4th row | F |
| 5th row | AA |
Common Values
| Value | Count | Frequency (%) |
| BA | 4511 | |
| A | 2268 | |
| F | 2235 | |
| AA | 1611 | 15.0% |
| E | 76 | 0.7% |
| (Missing) | 37 | 0.3% |
Length
Pie chart
| Value | Count | Frequency (%) |
| ba | 4511 | |
| a | 2268 | |
| f | 2235 | |
| aa | 1611 | 15.1% |
| e | 76 | 0.7% |
Most occurring characters
| Value | Count | Frequency (%) |
| No values found. | ||
Most occurring categories
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per category
Most occurring scripts
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per script
Most occurring blocks
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per block
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 84.0 KiB |
| 0 | |
|---|---|
| 1 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 0 |
|---|---|
| Distinct characters | 0 |
| Distinct categories | 0 ? |
| Distinct scripts | 0 ? |
| Distinct blocks | 0 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 0 |
|---|---|
| 2nd row | 0 |
| 3rd row | 0 |
| 4th row | 0 |
| 5th row | 0 |
Common Values
| Value | Count | Frequency (%) |
| 0 | 9443 | |
| 1 | 1295 | 12.1% |
Length
Pie chart
| Value | Count | Frequency (%) |
| 0 | 9443 | |
| 1 | 1295 | 12.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| No values found. | ||
Most occurring categories
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per category
Most occurring scripts
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per script
Most occurring blocks
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per block
Spearman's ρ
The Spearman's rank correlation coefficient (ρ) is a measure of monotonic correlation between two variables, and is therefore better in catching nonlinear monotonic correlations than Pearson's r. It's value lies between -1 and +1, -1 indicating total negative monotonic correlation, 0 indicating no monotonic correlation and 1 indicating total positive monotonic correlation.To calculate ρ for two variables X and Y, one divides the covariance of the rank variables of X and Y by the product of their standard deviations.
Pearson's r
The Pearson's correlation coefficient (r) is a measure of linear correlation between two variables. It's value lies between -1 and +1, -1 indicating total negative linear correlation, 0 indicating no linear correlation and 1 indicating total positive linear correlation. Furthermore, r is invariant under separate changes in location and scale of the two variables, implying that for a linear function the angle to the x-axis does not affect r.To calculate r for two variables X and Y, one divides the covariance of X and Y by the product of their standard deviations.
Kendall's τ
Similarly to Spearman's rank correlation coefficient, the Kendall rank correlation coefficient (τ) measures ordinal association between two variables. It's value lies between -1 and +1, -1 indicating total negative correlation, 0 indicating no correlation and 1 indicating total positive correlation.To calculate τ for two variables X and Y, one determines the number of concordant and discordant pairs of observations. τ is given by the number of concordant pairs minus the discordant pairs divided by the total number of pairs.
Cramér's V (φc)
Cramér's V is an association measure for nominal random variables. The coefficient ranges from 0 to 1, with 0 indicating independence and 1 indicating perfect association. The empirical estimators used for Cramér's V have been proved to be biased, even for large samples. We use a bias-corrected measure that has been proposed by Bergsma in 2013 that can be found here.Phik (φk)
Phik (φk) is a new and practical correlation coefficient that works consistently between categorical, ordinal and interval variables, captures non-linear dependency and reverts to the Pearson correlation coefficient in case of a bivariate normal input distribution. There is extensive documentation available here.First rows
| customer_id | customer_visit_score | customer_product_search_score | customer_ctr_score | customer_stay_score | customer_frequency_score | customer_product_variation_score | customer_order_score | customer_affinity_score | customer_active_segment | X1 | customer_category | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | csid_1 | 13.168425 | 9.447662 | -0.070203 | -0.139541 | 0.436956 | 4.705761 | 2.537985 | 7.959503 | C | F | 0 |
| 1 | csid_2 | 17.092979 | 7.329056 | 0.153298 | -0.102726 | 0.380340 | 4.205138 | 4.193444 | 17.517381 | C | A | 0 |
| 2 | csid_3 | 17.505334 | 5.143676 | 0.106709 | 0.262834 | 0.417648 | 4.479070 | 3.878971 | 12.595155 | C | BA | 0 |
| 3 | csid_4 | 31.423381 | 4.917740 | -0.020226 | -0.100526 | 0.778130 | 5.055535 | 2.708940 | 4.795073 | AA | F | 0 |
| 4 | csid_5 | 11.909502 | 4.237073 | 0.187178 | 0.172891 | 0.162067 | 3.445247 | 3.677360 | 56.636326 | C | AA | 0 |
| 5 | csid_6 | 9.007922 | 7.051568 | 0.161564 | 0.040997 | 0.191935 | 4.209840 | 3.181961 | 18.862680 | C | BA | 0 |
| 6 | csid_7 | 13.707109 | 5.625179 | 0.009634 | -0.019998 | 0.177622 | 4.165093 | 4.689834 | 109.203352 | B | E | 0 |
| 7 | csid_8 | 32.042122 | 3.563568 | -0.050730 | NaN | 0.257060 | 4.366761 | 4.041260 | 24.036321 | AA | A | 0 |
| 8 | csid_9 | 20.434181 | 5.111682 | 0.133922 | 0.036893 | 0.442314 | 4.759516 | 3.407424 | 17.078123 | C | BA | 0 |
| 9 | csid_10 | 13.778214 | 3.829299 | 0.159102 | 0.165818 | 0.558187 | 6.255980 | 3.315462 | 9.443864 | B | BA | 0 |
Last rows
| customer_id | customer_visit_score | customer_product_search_score | customer_ctr_score | customer_stay_score | customer_frequency_score | customer_product_variation_score | customer_order_score | customer_affinity_score | customer_active_segment | X1 | customer_category | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 10728 | csid_10729 | 24.772317 | 4.753238 | 0.019578 | -0.070097 | 0.556264 | 4.590020 | 3.126145 | 12.193862 | C | A | 0 |
| 10729 | csid_10730 | 11.657455 | 6.233353 | 0.007517 | -0.016122 | 0.476700 | 4.024655 | 2.727740 | 22.214286 | B | A | 0 |
| 10730 | csid_10731 | 18.793887 | 4.199826 | 0.143747 | 0.219525 | 0.267384 | 3.867731 | 2.893106 | 28.685574 | B | BA | 0 |
| 10731 | csid_10732 | 29.094167 | 6.391500 | -0.051283 | -0.079743 | 0.434865 | 4.791949 | 2.244512 | 6.251333 | B | BA | 0 |
| 10732 | csid_10733 | 14.664036 | 5.341811 | 0.043920 | -0.125090 | 0.269019 | 4.563034 | 3.685176 | 14.066261 | C | A | 0 |
| 10733 | csid_10734 | 23.672615 | 6.701514 | 0.092879 | -0.017332 | 1.210397 | 7.003663 | 3.027084 | 1.952911 | C | BA | 0 |
| 10734 | csid_10735 | 25.673028 | 6.497796 | 0.050216 | -0.047211 | 0.725230 | 5.407507 | 3.104172 | 5.124286 | C | BA | 0 |
| 10735 | csid_10736 | 31.676844 | 7.799880 | 0.062961 | -0.032765 | 0.318118 | 5.598486 | 2.403051 | 21.864188 | A | BA | 0 |
| 10736 | csid_10737 | 28.441780 | 5.588302 | -0.093931 | 0.081586 | 0.132177 | 3.616492 | 4.972243 | 86.969977 | B | AA | 0 |
| 10737 | csid_10738 | 20.663035 | 4.478301 | 0.253165 | 0.381349 | 0.504904 | 4.181092 | 4.469215 | 27.770899 | B | A | 0 |